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Abstract 

There has been some confusion concerning the animal group-size: an exponential 
distribution was deduced by maximizing the entropy; lognormal distributions were 
practically used; a power-law decay with exponent 3/2 was proposed in physical 
analogy to aerosol condensation. Here I show that the animal group-size distribution 
follows a power-law decay with exponent 1, and is truncated at a cut-off size which 
is the expected size of the groups an arbitrary individual engages in. An elementary 
model of animal aggregation based on binary splitting and coalescing on contingent 
encounter is presented. The model predicted size distribution holds for various data 
from pelagic fishes and mammalian herbivores in the wild. 
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1 Introduction 



It is hard to find animals in nature that do not aggregate (school, herd, swarm, 
or flock). Despite the universality of aggregation and its ecological and eco- 
nomic importance for the estimation of wildlife abundance, the statistical ques- 
tion about animal group-size has been involved in much confusion. Theoret- 
ically the Gibbs-Boltzmann (exponential) distribution has been proposed for 
the animal gr oup-size distribution by applying a maximum entropy principle 
, whereas practica lly the lognormal distribution has been used in 



fisheries ([Matsuishi et a/I 11993). Only l ately, it has been suggested t hat power- 
l aw di stributions may be quite generic ((Bonabeau fc Dagorrl 1995f ) . Anderson 



(|198l|), a pioneer in statistical research into group formation, proposed a 
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stochastic dynamic equation for the size of fish school, in which the possibility 



of pow er laws was already presented but not exploited. iBonabeau fc Dagorn 



(1995) Dresented a mod el of animal aggr egation inspired by a physical model 



of particle aggregation ( Takavasu . 19891 ). and predicted that the group-size 



distributions follow a power-law decay with exponent 3/2. The empirically 
determined power-law size distributions of fish schools have exponents in the 
range from 0.7 to 1.8 (Niwa, 1998; Bonabeau et ai, 1999). 

iV animals moving together form an iV-sized group. The group-size distribution 
W(N) is proportional to the observed number of iV-sized groups. Power-law 
distributions, W(N) oc N' 13 , have somewhat unusual properties. They do not 
have a well-defined mean when (3 < 2. Fat-tailed group-size distributions are 
necessarily truncated at a cut-off size because the population is finite, but 
truncated power law must be distinguished from purely rapidly decreasing 
ones, as they exhibit specific properties, e.g., a violation of the central-limit 
theorem (Mantegna & Stanley, 1995; Axtell, 2001). In order to investigate 
the possibility of power-law regimen in the group-size distribution, let us re- 
consider some existing data (Table 1) in terms of the population distribution 
P(N) = NW(N), which is proportional to the ratio of animal population in 
iV-sized groups to total population. 

Suppose that we have a data set arranged in a frequency distribution with n 
classes. Let Wi AN be the frequency of animal groups observed in the i-th class 
(where AN is the class width); the class mark of group size, iVj, is the mid- 
point of the class. Wi then represents the distribution density of group sizes. 
The number of individuals associated with groups of the i-th class is given 
by NiWiAN (denoted by Pi AN). Pi then represents the distribution den- 
sity of population. Shown in Fig.l is a semilogarithmic plot of the population 
distribution of pelagic fishes. Surprisingly, I find that the scaled population 
distributions appear to collapse onto a single curve, an exponentially decreas- 
ing function. It implies that (i) the population distribution has a well-defined 
mean 



(N) P = '£N i P i 




which is hereafter referred to as P-mean; (ii) the school-size distribution dis- 
plays robust scale-invariant behavior with the power- law index (3 = 1; and 
(iii) the power-law distribution of school sizes is truncated at the cut-off size 
equal to P-mean. P-mean is the average of the population distribution among 
group sizes, i.e., the expected size of the groups in which an arbitrary in- 
dividual engages. The population distribution is the crossover function from 
power-law to exponential decay. 
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2 The Model 



The empirical finding for fish school size suggests that the population distribu- 
tions among school sizes are identical for fishes regardless of large diversities in 
behavioral and ecological conditions. I propose a simple stochastic-differential- 
equation model accounting for group-size statistics. Assume binary splitting 
independent of group size (breakup rate p) and coalescing on contingent en- 
counter. Trace the size change of the group that a certain individual (named 
"A") rides. The group size at time t is denoted by N A (t). A discrete countable 
number of group size is replaced by a continuous group size, providing that 
there is a sufficiently large reservoir of individuals. Rewriting the difference 
(the size change AAa for a finite time interval At) to the differential of the 
group size for an infinitesimal time interval dt, 

AN A (t) = A A (t + At) - N A (t) -> dN A (t) (2) 

gives the Ito stochastic differential equation for group size: 

dN A (t) = -| (N A (t) - (N) P ) dt + a (N A (t)) dB(t) (3) 

where dB is Wiener noise term of normal distribution with zero mean and 
variance dt, and a (N A ) is the standard deviation of group-size changes. Fluc- 
tuations of the group size in aggregation-breakup processes, a(N A ), are nu- 
merically determined as follows: 

Assume for simulations that there are s sites, coarse-grained zones of space, on 
which $ individuals move. The number of individuals, $, is conserved. Each 
site is either empty or occupied by one group. At each discrete time step, all 
groups move towards a randomly selected site. They may move to any site 
with equal probability. This corresponds to the mean-field theory. When M- 
and A-sized groups happen to move to the same site, they aggregate to form 
an (M + A)-sized group. Each group with a size greater than or equal to 
2 splits into a pair of groups with the probability p at each time step. It is 
assumed that the probability p for a group to split is independent of its size 
and that the sizes of splitting groups are uniformly distributed: a probability 
for an A-sized group to split into M- and (A — M)-sized groups is represented 
by 

K h (N\M, N — M) — K h (N) = (4) 

for A > 2. Then, the expected decrement and increment of size of the group 
that a certain individual "A" rides are pN A /2 and <E>/s (i.e., coarse-grained 
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spatial population density) at each time step, respectively. Monte-Carlo simu- 
lations of animal-group aggregation show that fluctuations of the group size in 
aggregation-breakup processes exponentially depend on the size (Fig.2), and 



give 



a 2 (N) = 2Dexp 



N 



(5) 



where D denotes the size of fluctuations. The same results can be obtained 
with another model for a group to split: the size distribution of splitting groups 
is binomial. 



K b (N\M,N - M) 




(6) 



Let the domain of the size N be real numbers. The aggregation-breakup dy- 
namics [eqn (3)] is sup posed to b e sym metric about iV = 0. By performing 
the change of variable (jRichmondl . 2001 ) 



dx 



p/2 



dN a(N) 



(7) 



eqn (3) becomes the Langevin equation for a Brownian particle moving in a 
potential U(x): 



dx(t) = - d -^dt + f-dB{t) 



(8) 



where 



U(x) = - 




P(N) 2 P 
D 



x 



(N)py/p/D, 



(9) 



The induced potential is schematically depicted in Fig. 3. If the potential U(x) 
is continuously differentiable, the following relation is obtained: 



p(N) 



D 



(10) 



Therefore the breakup rate p, which corresponds to the dissipation in the 
stochastic differential equation, and the noise D cannot be independent. The 
numerical results indicate that this fluctuation-dissipation relation is already 
satisfied (Fig.2). 
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3 Results 



The stationary solution of eqn (3) reads the probability for individual "A" 
to be found in iV-sized groups, which is proportional to the stationary popu- 
lation distribution among group sizes, P(N). The distribution P(N) has the 
desired form and the stationary group-size distribution W(N) [= N~ 1 P(N)] 
is rigorously written as 



W(N) oc N' 1 exp 



N 



1 - 



3 -N/(N)i 



where eqn (5) is adopted for the variance with eqn (10). There is no parameter 
to be adjusted, since P-mean is obtained directly from the data [because of the 
fluctuation-dissipation relation, eqn (10), parameters p and D do not explicitly 
appear in the group-size distribution]. The self-consistency in the average of 
population distribution 



q cxp 



-q (l - e- 9 /2)] dq 



J exp [-q (l - e~V 2 )] d <? 



(12) 



is numerically certified by using 2 8 -digit precision for internal computations, 
where q = N/(N) P . 

The model accounting for group-size statistics presented here is consistent 
with a number of numerical experiments. Fig.4 represents the group-size dis- 
tribution obtained from simulations. The model is tested with empirical data 
for the fish school-size distributions and a close agreement is found (Fig.5). 
The normalization factor is given by 



exp 



-q 1 



/2) dq 



0.881237 



(13) 



The size distribution W(N) follows a power law with exponent f3 — 1 to a 
cut-off size {N)p, i.e., a well-defined mean of the associated distribution P(N) 
which is exponentially decaying function. Naturally, a well-defined mean does 
not exist for the group-size distribution. Moreover, a coarse-grained description 
of the group aggregation suggests that P-mean varies in proportion as the 
number of individuals or the spatial population density p, and inversely as the 
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breakup rate: 

(N) P oc P - (14) 
V 

which is numerically approved (Fig. 6). The group-size distribution, therefore, 
follows a universal scaling law, i.e., it decays as a power law with an exponential 
truncation controlled not only by the system size (number of individuals) but 
also the breakup rate. I indeed find a collapse of the empirical data onto a 
single curve as well as the collapse of the numerical results. Eqn (14) is utilized 
for data normalization (Table 1). 



4 Discussion 



I have reconsidered some existing data on pelagic fishes and found identical 
statistical signatures: the school-size distribution follows a power-law decay 
with the exponent 1 up to a cut-off size which is given by P-mean. The ra- 
tionale behind this robust scaling invariance is the exponential fluctuation of 
school-size change. I have proposed an Ito stochastic differential equation gov- 
erning the evolution of the group size that a certain individual rides, which is 
quite different approach from Anderson's (1981) model based on a stochastic 
dynamics for the change of the number of individuals in a certain group. The 
model predicts not only the power-law behavior observed in nature, but also 
the deviation from pure power-law towards exponential decay. The similar- 
ity between the empirical distribution and the distribution obtained from the 
model is striking, though there is no fitting parameter. The exponential fluctu- 
ation of group size is the essential ingredient of underlying aggregate-breakup 
dynamics that influence animal group size. If the size fluctuation cr 2 (N) were 
proportional to the group size N, the group-size distri bution would follow a 



power -law decay with exponent 3/2, as was suggested bv lBonabeau h Dagorn 

(1223. 



The interacting group systems introduced here have a marked feature in con- 
trast to physical systems in thermal equilibrium: the aggregate-breakup pro- 
cess i n anim a l-grou p systems does not allow for detailed balance as below; 
while Okubol (|l986| ) made the detailed balance assumptions in order to obtain 



the group-size distribution. 

The Smoluchowski rate equation is the alternative equation governing the 
group-size distribution (Gueron & Levin, 1995; Gueron, 1998; Durrett et al, 
1999), and is equivalent to the follo wing master equation for associated dis- 
tribution of population ()Niwa . 1998| ): 
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P(N,t + At) - P(N,t) 

= -P(N, t) rn(N, M;t) + J2 P ( M - t)m{M, N; t) (15) 

M=l M=l 

with the transition probability for a specific individual which engages in a 
group of size N to pass into a group of size M in a finite time interval At: 



m(N,M;t) 

= A" b (JV|M, N - M)f + P(M - *,«) *-<Y-~/ W (16) 

where -fT a denotes the probability for an iV-sized group to join with an M- 
sized group in a finite time interval At. Whenever two groups meet they 
are supposed to join, which implies Kg, = s^ 1 for a coarse-grained model 
of animal-group system. Assume the detailed balance condition: there are as 
many transitions per At from M-sized groups to iV-sized groups as from N 
to M by the inverse process. The following recurrence formula for stationary 
solution is then obtained [K\, is given by eqn (4)] 

P(l) 

P(N) = P(N - 1)N^^- (17) 

sp 



which leads to an explicit solution for group-size distribution 
W(l) 



W(N) 



where ttjv,^ is the Poisson distribution with mean ji = sp/W(l). The result 
contradicts the simulated group-size distribution; besides, eqn (18) violates 
the conservation law of population. The detailed balance conditions therefore 
do not hold in the aggregate-breakup systems, or else we may have another 
deterministic equation which is consistent with micro-reversibility. 

The model developed here can apply to a wide spectrum of cases on animal 
species. Fig. 7 shows the herd-size distributions for mammalian herbivores; 
these distributions are much bet ter fitte d by a power law than by an exponen- 
tial decay, as was suggested by Okubol (1986). We may however notice that 



the empirical data are not in perfect agreement with eqn (11). It may be ex- 
plained by biases in the data. Another possibility bringing a subtle difference 
between data and theory is that the uniform-breakup assumption (splitting 
probability independent of group size) is no longer valid. 

There is yet another possibility. This subtle difference may be caused by the 
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fluctuation-dissipation relation reduced above. It can be generalized as follows: 
p(N) 2 P = x*D (19) 



where x$ > 1, and potential U(x) is no longer differentiable at x = but 
continuous (Fig. 3). The population distribution among group sizes is then 
modified into 



P(N) (x exp 



N 



P(N) 
2D 



2 ^ 
P e -N/{N) P 



(20) 



which guarantees the self-consistency in the average of distribution as well as 
eqn (12). The modified size distribution by adopting eqn (19) matches better 
the data. Yet, the modified distribution [e qn (20)1 can apply to schools of 
sardinellas caught in the up- welling areas (|Bonabeau et all Il999l ). and the 
agreement between the empirical distribution and the model's prediction is 
remarkable (Fig. 8). 
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Figure legends 



Fig.l Empirical population distribution of pelagic fishes, as shown in Table 1 
(except sardinellas) . The scaled distributions Pi(N)p are plotted against the 
scaled school sizes Ni/(N)p. The data are clearly fitted by an exponential 
decay suggesting that the distributions are identical, e.g., exp (— N/(N)p) 
(broken line). The proposed model is somewhat more complicated (solid 
line) . 

Fig2 Exponential fluctuations of group-size change (numerical results). The ordi- 
nate is the variance of size change of the group, ((AN A — (AN A ) jy) 2 ^ , for 

a finite time interval (single time step of simulation run), divided by p(N) P , 
where (AN\) N denotes the expected value of the size change AN a for a 
finite time interval, providing that individual "A" rides an iV-sized group. 
The abscissa is the group size N divided by P-mean. Simulations show 
that the group sizes exponentially fluctuate [solid line represents <r 2 (iV) oc 
exp (N/(N)p)}. For the simulation parameters, consult Table 2. Note that 
the variance is scaled based on the fluctuation-dissipation relation [described 
later; eqn (10)]. 

Fig.3 Potential induced by the change of variable, (a) Potential U(x) is continu- 
ously differentiable (solid line). The ordinate is the potential U(x) divided 
by the breakup rate p. The abscissa is the reduced size x. P-mean {N)p 
takes positive values for x > and negative values for x < 0. The reduced 
domain of x is (—1, 1). If potential U(x) is continuous (broken line showing 
U/p versus xj \Jp(N) P /D), the fluctuation-dissipation relation is general- 
ized into eqn (19). (b) The group size N as a function of the reduced size 
x. The ordinate is the scaled group size N/ (N) P . 

Fig.4 Simulated group-size distribution. The scaled distributions Wn{N)p are 
plotted against the scaled group sizes N/(N)p. The scaled distributions 
collapse onto a single curve that corresponds to eqn (11) with normalization 
factor eqn (13). The simulation parameters are summarized in Table 2. 

Fig. 5 Empirical school-size distribution of pelagic fishes (the same data sets as 
Fig.l). The scaled distributions W,i(N)p are plotted against the scaled 
school sizes Ni/(N) P . The scaled data collapse onto a single curve that 
corresponds to eqn (11) with normalization factor eqn (13). 

Fig. 6 Plot of P-mean against the scaled population size Q/sp, i.e., the ratio of 
population density to breakup probability (numerical results). Parameters 
are summarized in Table 2. The broken line is a fit with slope 2.907 ±0.017. 

Fig. 7 Empirical herd-size distribution of mammalian herbivores, as shown in Ta- 
ble 1 (scaled size distributions). The solid and broken lines correspond to 
eqn (11) with normalization factor [eqn (13)] and to a least-squares fit of 



10 



the modified size distribution with x\ = 7.96 ± 1.61, respectively. 

Fig. 8 Two kinds of empirical size distributions of sardinellas schools, as shown 
in Table 1 (scaled size distributions), (a) Population distribution. A least- 
squares fit of the data to eqn (20) with = 14. Mil. 95 is shown, (b) School- 
size distribution. The solid and broken lines correspond to eqn (11) with 
normalization factor [eqn (13)] and to a fit of the modified school-size dis- 
tribution, respectively. 
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Figures 

Fig.l Empirical population distribution of pelagic fishes 

Fig. 2 Exponential fluctuations of group-size change 

Fig. 3 Potential induced by the change of variable 

Fig.4 Simulated group-size distribution 

Fig. 5 Empirical school-size distribution of pelagic fishes 

Fig. 6 Plot of P-mean against the scaled population size 

Fig. 7 Empirical herd-size distribution of mammalian herbivores 

Fig. 8 Two kinds of empirical size distributions of sardinellas schools 
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Table 1 

Species analyzed. The ways of estimating school sizes of pelagic fishes were catch 
per set by a purse seine (in tons) or acoustic surveys (vertical cross-section, vertical 
thickness or diameter of a school were observed) . Acoustic-survey data are expressed 
in dimensional size of a school, which can be reduced to the biomass in a school 
(Squire, 1978; Anderson, 1981; Misund, 1993; Niwa, 1995; Misund & Coetzee, 2000): 
the school biomass is proportional to the cross-section, the square of thickness or the 
square of diameter of a school. The data (distribution density) are given by the set 
{ (iVj, Wj) | i = 1, 2, . . . , n}. W{AN reads the frequency of group sizes which lie within 
the i-th class [Ni - AN/2, Ni + AN/2), where AN denotes the class width and the 
i-th class mark is given by Ni = (i — 0.5) AN. Data (biomass or reduced number in 
a group) are normalized as Y17=l NiWiAN = (N)p, where P-mean (N)p is defined 



by eqn (1). The normalization is based on 


the linear dependence of P-mean on the 


population size [described later; eqn (14), 


Fig.6]. 




Species 


P-mean* 


Data sources 


pelagic fishes 






□ Tropical tuna,^ Data from fisheries 


11.69 


Bonabeau & Daeorn (1995) 


Tropical tuna,^ Data from fisheries 


4.80 


Bonabcau et al. f 1999) 


in the vicinity of fish aggregating objects 




A Engraulis mordax} Acoustic survey 


15.67 


Smith (1970). 






cited in Anderson (1981) 


• Sardinops melanosticta 


6.45-17.36 


Hara (1983, 1985, 1986) 


6 acoustic surveys 






+ Clupea harengus 


7.24-10.88 


Reid et al (2000) 


4 acoustic surveys 






o Sardinellas aurita and S. maderensis 40.73 


Bonabeau et al. f 1999) 


Catch in the up-welling areas 






terrestrial herbivores^ 






♦ Gazella thomsoni 


5.04 


Wirtz & Ldrscher f 19831 


Redunca redunca 


3.22 


Wirtz & Ldrscher (1983) 


■ Kobus ellipsiprymnus 


5.22 


Wirtz & Ldrscher 1983) 


□ Bison bison 


5.56 


Lott & Minta 1983) 


k Syncerus caffer 


5.35 


Sinclair 1977) 


A Ovis canadensis 


9.53 


Hansen (-\9$Q) 



*A unit size is (N\ + N2)/2, which may contain a certain number of individuals. 
^Three species (Thunnus albacares, Katsuwonus pelamis, and Thunnus obesus) 
are mixed. 

"'■possibly including Trachurus symmetricus, Sarda chiliensis, Scomber japonicus, 

and Sardinops sagax. 

§Data are cited in Qkubo (1986). 
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Table 2 

Parameters used in simulations. Simulations have been performed with the 
coarse-grained zones of s = 2 18 sites, simulation run = 2 21 time steps, and param- 
eters summarized in the table. The splitting probability at each time step of sim- 
ulation run is defined by eqn (4). The frequency of the amount of iV-sized groups 
(N = 1, 2, ...,$) at the last of simulation run is represented by Wn in Fig.4, where 
the frequency distribution is normalized as X^jv=i NWn = (N) p. P-mean sizes were 
computed from simulation results. 

breakup probability p population P-mean 



♦ 0.02 2 14 9.00 
■ 0.02 2 15 17.54 
▲ 0.02 2 16 37.52 

• 0.02 2 17 73.80 

* 0.02 2 18 147.27 
0l 2 Tf 143.03 
□ 0.03 2 17 47.01 
A 0.04 2 17 36.77 
o 0.1 2 17 14.14 
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